Open
Conversation
Signed-off-by: Alex Chi <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
* test:add a test case to cover week_1_day_3_task3 Closes: #23 Signed-off-by: Jiawei Zhao <Phoenix500526@163.com> * fmt Signed-off-by: Alex Chi Z <iskyzh@gmail.com> --------- Signed-off-by: Jiawei Zhao <Phoenix500526@163.com> Signed-off-by: Alex Chi Z <iskyzh@gmail.com> Co-authored-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Add KV cache module imports to both tiny_llm and tiny_llm_ref packages to enable KV cache functionality. Include comprehensive test suite for week 2 day 1 covering embedding operations, model inference with KV cache, and sequential token generation with offset support. - Add KV cache imports to __init__.py files - Create test_week_2_day_1.py with task 2-4 test coverage - Support multiple Qwen2 model variants (0.5B, 1.5B, 7B) - Include embedding call and as_linear functionality tests - Add sequential generation tests with proper cache management
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Extract string replacement operation outside f-string expression
to avoid backslash in f-string expression part, which is not
allowed in Python syntax.
- Move .replace('\n', ' ') operation to separate variable
- Improves code readability and fixes SyntaxError
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
Co-authored-by: Yangchen Ye <yangchenye@Yangchens-MacBook-Pro.local>
* docs: add instruction to download Qwen2-1.5B model
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
Extract newline character to a variable to avoid backslash in f-string expression part. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>
Signed-off-by: KKKZOZ <kkkzoz@qq.com>
- Add complete quantized_matmul_impl_typed template function for CPU, which support float16, float32, and bfloat16 data types - Add float32 test cases for quantized_matmul - Adjust float32 tolerance in test utils for better precision
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
* docs: add Week 2 Day 2-3 Quantized Matmul chapter - Add quantized matmul documentation (week2-02-quantized-matmul.md) Signed-off-by: Connor1996 <zbk602423539@gmail.com>
* docs: add week2 quantized matmul GPU part Signed-off-by: Connor1996 <zbk602423539@gmail.com>
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
bugfix: way to get_kernel from library
Signed-off-by: you06 <you1474600@gmail.com>
book: replace `huggingface-cli` with `hf`
* docs: add week2 flash-attention chapter links and draft Signed-off-by: Connor1996 <zbk602423539@gmail.com>
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
Signed-off-by: Connor1996 <zbk602423539@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
I just use os library to check whether it is mlx or pytorch and there is a specific function and test for each case.